Search CORE

Springer - Publisher Connector

F2C2: a fast tool for the computation of flux coupling in genome-scale metabolic networks

Author: Bockmayr A.
David L.
Larhlimi A.
Selbig J.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Background: Flux coupling analysis (FCA) has become a useful tool in the constraint-based analysis of genome-scale metabolic networks. FCA allows detecting dependencies between reaction fluxes of metabolic networks at steady-state. On the one hand, this can help in the curation of reconstructed metabolic networks by verifying whether the coupling between reactions is in agreement with the experimental findings. On the other hand, FCA can aid in defining intervention strategies to knock out target reactions. Results: We present a new method F2C2 for FCA, which is orders of magnitude faster than previous approaches. As a consequence, FCA of genome-scale metabolic networks can now be performed in a routine manner. Conclusions: We propose F2C2 as a fast tool for the computation of flux coupling in genome-scale metabolic networks. F2C2 is freely available for non-commercial use at https://sourceforge.net/projects/f2c2/files/

Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT

Author: Achenbach U.
Basekow R.
Diehl S.
Gebhardt C.
Gyetvai G.
Kersten B.
Neigenfind J.
Selbig J.
Publication venue
Publication date
Field of study

pcaMethods - a bioconductor package providing PCA methods for incomplete data

Author: Redestig H.
Scholz M.
Selbig J.
Stacklies W.
Walther D.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2007
Field of study

pcaMethods is a Bioconductor compliant library for computing principal component analysis (PCA) on incomplete data sets. The results can be analyzed directly or used to estimate missing values to enable the use of missing value sensitive statistical methods. The package was mainly developed with microarray and metabolite data sets in mind, but can be applied to any other incomplete data set as well

arXiv.org e-Print Archive

Structural Kinetic Modeling of Metabolic Networks

Author: B. Blasius
Bailey
Bier
Gagneur
Gross
Hynne
J. Selbig
Kell
Morohashi
Palsson
Poolman
R. Steuer
Rosenfeld
Ryde-Pettersson
Schuster
Stelling
Stephanopoulos
T. Gross
Tomita
Wolf
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 01/01/2006
Field of study

To develop and investigate detailed mathematical models of cellular metabolic processes is one of the primary challenges in systems biology. However, despite considerable advance in the topological analysis of metabolic networks, explicit kinetic modeling based on differential equations is still often severely hampered by inadequate knowledge of the enzyme-kinetic rate laws and their associated parameter values. Here we propose a method that aims to give a detailed and quantitative account of the dynamical capabilities of metabolic systems, without requiring any explicit information about the particular functional form of the rate equations. Our approach is based on constructing a local linear model at each point in parameter space, such that each element of the model is either directly experimentally accessible, or amenable to a straightforward biochemical interpretation. This ensemble of local linear models, encompassing all possible explicit kinetic models, then allows for a systematic statistical exploration of the comprehensive parameter space. The method is applied to two paradigmatic examples: The glycolytic pathway of yeast and a realistic-scale representation of the photosynthetic Calvin cycle.Comment: 14 pages, 8 figures (color

Explore Bristol Research

BMC Bioinformatics

Author: Black G.
Jacobsen M.
Kaufmann S.
Kern S.
Parida S.
Repsilber D.
Selbig J.
Telaar A.
Walzl G.
Publication venue
Publication date: 14/01/2010
Field of study

Background: For heterogeneous tissues, such as blood, measurements of gene expression are confounded by relative proportions of cell types involved. Conclusions have to rely on estimation of gene expression signals for homogeneous cell populations, e.g. by applying micro-dissection, fluorescence activated cell sorting, or in-silico deconfounding. We studied feasibility and validity of a non-negative matrix decomposition algorithm using experimental gene expression data for blood and sorted cells from the same donor samples. Our objective was to optimize the algorithm regarding detection of differentially expressed genes and to enable its use for classification in the difficult scenario of reversely regulated genes. This would be of importance for the identification of candidate biomarkers in heterogeneous tissues. Results: Experimental data and simulation studies involving noise parameters estimated from these data revealed that for valid detection of differential gene expression, quantile normalization and use of non-log data are optimal. We demonstrate the feasibility of predicting proportions of constituting cell types from gene expression data of single samples, as a prerequisite for a deconfounding-based classification approach. Classification cross-validation errors with and without using deconfounding results are reported as well as sample-size dependencies. Implementation of the algorithm, simulation and analysis scripts are available. Conclusions: The deconfounding algorithm without decorrelation using quantile normalization on non-log data is proposed for biomarkers that are difficult to detect, and for cases where confounding by varying proportions of cell types is the suspected reason. In this case, a deconfounding ranking approach can be used as a powerful alternative to, or complement of, other statistical learning approaches to define candidate biomarkers for molecular diagnosis and prediction in biomedicine, in realistically noisy conditions and with moderate sample sizes

7T MRI in natalizumab-associated PML and ongoing MS disease activity: a case study

Author: Faiss J.
Kern P.
Kunkel A.
Kühl M.
Mekle R.
Niendorf T.
Othman J.
Paul F.
Selbig I.
Sinnecker T.
Wienecke P.
Wuerfel J.
Publication venue: 'Ovid Technologies (Wolters Kluwer Health)'
Publication date: 29/10/2015
Field of study

OBJECTIVE: To assess the ability of ultra-high-field MRI to distinguish early progressive multifocal leukoencephalopathy (PML) from multiple sclerosis (MS) lesions in a rare case of simultaneous presentation of natalizumab-associated PML and ongoing MS activity. METHODS: Advanced neuroimaging including 1.5T, 3T, and 7T MRI with a spatial resolution of up to 0.08 mm(3) was performed. RESULTS: 7T MRI differentiated between PML-related and MS-related brain damage in vivo. Ring-enhancing MS plaques displayed a central vein, whereas confluent PML lesions were preceded by punctate or milky way-like T2 lesions. CONCLUSIONS: Given the importance of early diagnosis of treatment-associated PML, future systematic studies are warranted to assess the value of highly resolving MRI in differentiating between early PML- and MS-induced brain parenchymal lesions

MDC Repository

Metabolite fingerprinting: detecting biological features by independent component analysis

Author: A. Sterling
J. Selbig
M. Scholz
O. Fiehn
S. Gatzek
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

PhosPhAt: a database of phosphorylation sites in Arabidopsis thaliana and a plant-specific phosphorylation site predictor

Author: Ashburner
Blom
Chung
D. Walther
de la Fuente van Bentem
Forde
Hrabak
Huala
Hummel
J. Hummel
J. L. Heazlewood
J. Selbig
Olsen
P. Durek
Pawson
Pawson
Perkins
Tobin
W. Weckwerth
W. X. Schulze
Weckwerth
Yaffe
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

The PhosPhAt database provides a resource consolidating our current knowledge of mass spectrometry-based identified phosphorylation sites in Arabidopsis and combines it with phosphorylation site prediction specifically trained on experimentally identified Arabidopsis phosphorylation motifs. The database currently contains 1187 unique tryptic peptide sequences encompassing 1053 Arabidopsis proteins. Among the characterized phosphorylation sites, there are over 1000 with unambiguous site assignments, and nearly 500 for which the precise phosphorylation site could not be determined. The database is searchable by protein accession number, physical peptide characteristics, as well as by experimental conditions (tissue sampled, phosphopeptide enrichment method). For each protein, a phosphorylation site overview is presented in tabular form with detailed information on each identified phosphopeptide. We have utilized a set of 802 experimentally validated serine phosphorylation sites to develop a method for prediction of serine phosphorylation (pSer) in Arabidopsis. An analysis of the current annotated Arabidopsis proteome yielded in 27 782 predicted phosphoserine sites distributed across 17 035 proteins. These prediction results are summarized graphically in the database together with the experimental phosphorylation sites in a whole sequence context. The Arabidopsis Protein Phosphorylation Site Database (PhosPhAt) provides a valuable resource to the plant science community and can be accessed through the following link http://phosphat.mpimp-golm.mpg.d

Simultaneous alignment and folding of protein sequences

Author: A. Caprara
B.E. Shakhnovich
C.B. Do
C.B. Do
D. Frishman
D. Sankoff
D.H. Mathews
G. Raghava
I.L. Hofacker
J. Selbig
J. Waldispuhl
J. Waldispuhl
J.H. Havgaard
L.R. Forrest
M. Brudno
M. Cline
M. Lomize
M. Menke
P. Bradley
P. Fariselli
P. Rice
R. Backofen
R. Doolittle
R.A. Sutormin
R.C. Edgar
R.C. Edgar
R.C. Edgar
R.L.J. Dunbrack
S. Henikoff
S. Will
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Accurate comparative analysis tools for low-homology proteins remains a difficult challenge in computational biology, especially sequence alignment and consensus folding problems. We presentpartiFold-Align, the first algorithm for simultaneous alignment and consensus folding of unaligned protein sequences; the algorithm’s complexity is polynomial in time and space. Algorithmically,partiFold-Align exploits sparsity in the set of super-secondary structure pairings and alignment candidates to achieve an effectively cubic running time for simultaneous pairwise alignment and folding. We demonstrate the efficacy of these techniques on transmembrane β-barrel proteins, an important yet difficult class of proteins with few known three-dimensional structures. Testing against structurally derived sequence alignments,partiFold-Align significantly outperforms state-of-the-art pairwise sequence alignment tools in the most difficult low sequence homology case and improves secondary structure prediction where current approaches fail. Importantly, partiFold-Align requires no prior training. These general techniques are widely applicable to many more protein families. partiFold-Align is available at http://partiFold.csail.mit.edu

CiteSeerX

DSpace@MIT